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^ Abstract 

r^. Leptonic decays of the Higgs boson in the ZZ^*") channel yield what is known as the golden 

channel due to its clean signature and good total invariant mass resolution. In addition, the full 

> 

kinematic distribution of the decay products can be reconstructed, which, nonetheless, is not taken 



(N 
(N 



into account in traditional search strategy relying only on measurements of the total invariant 



00 mass. In this work we implement a type of multivariate analysis known as the matrix element 
O 

^ method, which exploits differences in the full production and decay matrix elements between the 

^ Higgs boson and the dominant irreducible background from qq ZZ^*\ Analytic expressions 

^ of the differential distributions for both the signal and the background are also presented. We 

H 

perform a study for the Large Hadron Collider at ^/s = 7 TeV for Higgs masses between 175 and 
350 GeV. We find that, with an integrated luminosity of 2.5 fb~^ or higher, improvements in the 
order of 10 — 20% could be obtained for both discovery significance and exclusion limits in the high 
mass region, where the differences in the angular correlations between signal and background are 
most pronounced. 
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I. INTRODUCTION 



The discovery of the Higgs boson [T] would be the triumphant culmination of the exper- 
imental quest to discover the particles of the Standard Model. The Tevatron has already 
set interesting limits on the Standard Model (SM) Higgs boson in the intermediate mass 
range [2]. At the Large Hadron Collider (LHC), with data corresponding to roughly 1 fb^^ of 
integrated luminosity, the ATLAS collaboration has announced exclusions at 95% confidence 
level of Higgs masses in the ranges 155 — 190 GeV and 295 — 450 GeV j3], while the CMS 
collaboration's limits are in the ranges 149 — 206 GeV and 300 — 440 GeV [1] . High mass 
limits from both collaborations are driven by measurements in the ZZ^*'' channel, which is 
considered the main discovery channel of the Higgs boson for masses above 200 GeV. 

Among the different decay products of the Higgs into ZZ^*'> bosons, the one with both Z 
bosons decaying into e'^e" or fi'^fi"^ is often referred to as "the golden channel" because of 
the good invariant mass resolution and well-controlled background. The traditional search 
strategy using the golden channel thus focuses on measuring the invariant mass spectrum 
of the four leptons. However, given that four-momenta of all decay products can be recon- 
structed with sufficient resolution, it is possible to measure more than just the total invariant 
mass of the four leptons. In fact, there are a total of five angles (and two additional in- 
variant masses, those of the off-shell Z bosons) that can be measured. Obviously it would 
be advantageous to incorporate all available kinematic information when searching for the 
Higgs boson. 

Additional kinematic variables can be included in an experimental measurement by mul- 
tivariate analyses |5] , which already have a wide range of applications in many measurements 
done at the Tevatron and the B factories. Several multivariate methods have been employed, 
such as neural nets; boosted decision trees; and the Matrix Element Method (MEM) the 
best-known use of which has been in studying the top quark at the Tevatron [7j. Even in 
some of the LHC Higgs analyses, most notably leptonic decays of Higgs to W^W~ where 
there are two missing neutrinos, multivariate analyses, in particular boosted decision trees, 
have been used to incorporate additional kinematic observables such as the opening angle 
between the two charged leptons in the final states [8]. On the other hand, it is somewhat 
surprising that in the golden channel, where there is no missing particle in the final state and 
all angles can be reconstructed, no experimental analysis that we are aware of has considered 
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supplementing total invariant mass with angular correlations to search for the Higgs boson. 
(For recent analyses, see 

Angular correlations of Higgs decays in the golden channel have been studied previously, 
to determine the spin and CP properties of the putative Higgs resonance [101 [ll] • A par- 
ticularly useful observable, the azimuthal angle between the decay planes of the two Z 
bosons, was pointed out recently in Refs. [121 US] • Subsequently, two comprehensive studies 
appeared in Refs. |lH |T5]. These works included the computation of the angular correla- 
tions of the final state leptons resulting from the production of a resonance (with arbitrary 
spin less than or equal to two) which in turn decays, via general couplings, to a pair of Z 
bosons, which subsequently decay leptonically. Both analyses also implemented the MEM 
in this ZZ^*^ — )• M channel, to distinguish between various hypotheses for the spin and CP 
properties of the putative Higgs signal at the LHC with ^/s = 14 and 10 TeV, respectively. 
Moreover, Ref. [15] briefly discussed using the MEM to enhance the Higgs discovery reach in 
the golden channel at 10 TeV for two specific values of the Higgs mass (200 and 350 GeV). 

In the present work, instead of comparing angular correlations for different spin and CP 
assumptions for a singly produced resonance, we aim at distinguishing the SM Higgs boson 
signal from the dominant irreducible background qq — t- ZZ^*^ — )► 4£ using the MEM at the 
LHC with y/s = 7 TeV. The scattering amplitude for qq — >■ ZZ^*^ — )■ 4£, when both Z bosons 
are on-shell, has been computed long ago in Refs. [161 HZ]- We extend the calculation of 
Ref. [TT] to off-shell Z bosons and present analytic expressions for the fully differential cross 
section. Then we perform a Monte Carlo study, implementing the MEM to determine the 
improvement in sensitivity for Higgs boson searches in the golden channel over a significant 
range of Higgs masses (between 175 and 350 GeV). We perform these analyses for several 
integrated luminosities, between 1 and 7.5 fb~^. For simplicity we consider only the 0-jet 
bin and assume that events have no intrinsic pt, though we expect the qualitative features 
of our results to be more generally apphcable. 

This work is organized as follows: in Sect. [IT] we introduce and define kinematic variables 
to be used in the fully differential cross sections. In particular, Lorentz-invariant expressions 
for all production and decay angles are presented, so that the kinematic distributions can 
be reconstructed using measurements done in the laboratory frame. In Sect. HI we com- 



pute the amplitude and cross section for qq — )■ ZZ^*'^ — M using the technique of helicity 



amplitudes introduced in Ref. [TB], allowing both Z bosons to be off-shell. In Sect. IV the 
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MEM is briefly reviewed, as well as the relevant statistical procedures we employed. The 
Monte Carlo study, including event generation, detector smearing effects, and construction 
of pseudo-experiments is discussed in Sect. |Vl Then we present our results for both expected 



signiflcance and exclusion limits in Sect. VI Finally, we close with conclusions in Sect. VII 



II. KINEMATICS 



As noted above, in this study we consider events in which two Z bosons are produced, 
either from the decay of a SM Higgs Boson produced in the gluon fusion channel or from 
t(n)-channel qq production. Each Z boson, which could be either on or off the mass shell, 
decays to a lepton [It) and an anti-lepton (£). We do not consider events with additional 
particles in the final state; thus the transverse momentum of the 4£ system is assumed to 
be negligible. In other words, we only consider exclusive ZZ^*^ — )■ M processes. 

In these events, the final state can be completely reconstructed. In general the kinematics 
can be specified in terms of two production angles of the ZZ^*'' system, one of which is 
irrelevant; four decay angles describing ZZ^*^ — )■ 4£; and the invariant masses of the Z's. In 
hadron colliders it is also necessary to know the momentum fractions of the initial massless 
partons, Xi, and X2, in order to compute the differential cross sections. We now describe our 
convention for the angles which specify the event and how to obtain them for a particular 
event. In particular, we provide Lorentz-invariant definitions of all angles, allowing for their 
determination from four-momenta reconstructed in the laboratory (Lab) frame. 



A. Definition of angles 

Let pi and p2 be the momenta of the lepton pair coming from Zi, and and be 
the momenta of the lepton pair from Z2, while ki^2 are the momenta of Zi 2. Our notation 
is such that pi = ii, p2 = ii, Ps = £2, Pi = i-e. Pi is the momentum of the lepton 
from Zi decay, p2 the momentum of the antilepton from Zi decay, etc. We denote the 
momenta of the incoming partons by kg and kg. The total momentum of the ZZ^*^ system 
is P = kg + kg = ki + k2 = Pi + P2 + Ps + Pa, which satisfies P'^ = s = M^. For Higgs 
production in the gluon fusion channel, the incoming partons are self-conjugate, kg = A;giuon,i, 
kq = fcgiuon,2, and the total momentum P is the Higgs momentum. 
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FIG. 1: (a) Two decay planes of Zi — )■ iiii, i = 1,2. The polar angles 9i shown are defined in the 
rest frames of Zi with respect to ki, while the azimuthal angles shown are in fact 27r — </>i = —(pi 
and IT — (f)2- (b) The coordinate system in the CM frame and the definition of the production angle 

e. 

As indicated in Fig. [l| we choose the coordinate system in the center-of-mass (CM) frame 
of the two Z's system as: 



ZCM = h 



ycM 



XCM = yCM X ZcM 



-kq + ki{kq ■ ki) 



(1) 



\kq X /c]^ I ^ ^1 1 

Furthermore, we define Zi as the rest frame of the Zi boson by boosting the CM frame 
along /ci, while Z2 is obtained by first rotating CM frame with respect to ijcM by tt and then 
boosting along k2. The production angle 9 and decay angles {6*1,^^2,01,02} are defined as 
follows: 

• B: polar angle of the momentum of the incoming quark in the CM frame. 

• di 2'- polar angle of the momentum of £1^2 in the Z12 frame. 

• 01^2^ azimuthal angle of £1^2 in the Z12 frame. 

The azimuthal production angle is irrelevant and chosen to be zero. In these definitions, 
three-momenta of ^l 2 in the Zi 2 frame can be written as 



P4 in the Z^ frame = jp^J (sin cos 0i, sin sin 0j, cos 6',) , i = 1,2 , 



(2) 



while the three-momentum of the incoming parton in the CM frame is 



kq in the CM frame = \kq\ (— sin 0, 0, cos O) . 



(3) 



In hadron colliders the CM frame of the two Z's system is different from the Lab frame 
and the event as a whole will be boosted along the beam axis with respect to the Lab 
frame, P = (P°,0,0,P^). Also, we have chosen to define the coordinate system in the 
CM frame such that the z axis is defined by the Zi three-momentum, rather than by the 
three-momentum of the incident partons, as is natural in the Lab frame. 

The total energy and momentum of the event P in the Lab frame can be used to de- 
termine the momentum fractions of the incident partons. Following Ref. [19], we write 
kg = Xi{Ecm, 0, 0, Ecm) and kg = X2iEcm, 0, 0, -Ecm), where E^m = y/s/2 is the CM energy 
of the colliding protons. From P = kg + kg we see that s = X1X2S and 

kg = ^(P° + P^O,0,P° + P^) , (4) 
kg = i(pO-P^O,0,P^-P°) , (5) 

which are valid in the Lab frame. 



B. Lorentz-invariant construction of angles 

— * — * — * — * 

In the CM frame, ki and ^2 are back to back and of equal magnitude, as are kg and kg. 
Using P = M(l, 0, 0, 0) we can work out the energy and three-momentum of the incoming 
partons. 



Eg = Eg = \kg\ = \kg\ = ^ , 



as well as that of the two Z's, 



^. = ^, \^^\ = \ [-^) -"^?2 = Az, . = 1,2, (7) 



where we define mf^ = {pi + PjY = 2pi ■ pj. Alternatively, \z = y {P ■ k2/M) — m'^^. 
Since cos Q = kg ■ ki, hj computing kg ■ ki it is simple to derive 

1^,1 l^il M \z ' 

By definition cos0 changes sign under kg -H- kg, which is manifest in Eq. (j8|. Thus when 
the direction of the incoming quark cannot be distinguished from the anti-quark, as is the 
case for hadron colliders, or when the incoming partons are self-conjugate as in the Higgs 
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production channel we consider, one can only determine cos up to a minus sign. Because 
6 is only defined between and vr, it is not necessary to compute sin 9. 

Next we consider 6'j, which was worked out in the appendix of Ref. |T3] . Since ki = pi+P2, 
in the CM frame we can write ki = {Ei + i?2,0,0, \pi + p2\) and solve for the boost that 
takes ki from the CM frame to the rest frame of Zi where it is (mi2, 0, 0, 0): 




7 -7/3 \ / El + E2 



(9) 



from which we get 



/;^f±f. (10) 

The inverse boost would then take pi = (mi2/2)(l, sin^i cos0i, sin6'i sin0i, cos^i) in the Zi 
rest frame to pi = {Ei,pi) in the CM frame, implying the following relation: 

Ei = 7^(l + /3cos^^i) , (11) 



from which we obtain 



Using Eq. ([t]) we arrive at 



E-\ — Eo E-\ — En , . 

cos^i = ^ ^ = — ^. (12) 



cos^i = ^P-(pi-p2) . (13) 



For cos 6^2, simply replace p\ and p2 by ^3 and p4, respectively, and we obtain 

1 

cos ^2 = ■ (P3 - P4) • (14) 

To compute 0j, we first construct the unit normal vectors of the two decay planes, 

,7 Px X p2 7 n X P4 

\P\ X P2\ \PZ X P4 



SO that 



A^i ■ ^CM = sin 01 , iVi ■ ycAf = - cos 0i , (16) 
A^2 ■ xcM = - sin 02 , ^^2 ■ ycM = - cos 02 . (17) 



The numerator of the normal vector can be written as 



(pi X p^r = j^^'""'>^,p,.Pp ^ j^^'-''"'' ' (18) 



where eoi23 = — e*^^^^ = 1. On the other hand, the Lorentz- invariant form of the numerator 
can be obtained using the relations 



\Pi 



1 



Pi - P , cos 9i 



m 



(19) 



where 9ij is the opening angle between pi and pj in the CM frame, so that in the end we 



have 

One then calculates 



Pi-Ppj-P _ rn4^ 
MM 4 



K. 



V ■ 



sm</)i 



coscpi 



cPlPikqP 



|pi X sin 9 M'^Ki2Sva.Q 
Pi X p2 kg X ki 2 



\Pi X P2I |L X A:i| M^KisAzsinO 



pi - kg pi-ki Pi - P 

P2 - kg P2-ki P2- P 

P-kg P-ki M2 



and similarly 



sm (p2 



cos (p2 



M'^Ku sin e 
2 

M^Ks^Xz sin 9 



P3 - kg p3- ki ps- P 
Pa - kg pi-ki Pa- P 
P-kg P-ki M2 



(20) 



(21) 



(22) 



(23) 



(24) 



It is also worth noting that when kg ^ —kg, (pi ^ ti + (pi. So in hadron colliders or gluon fu- 
sion production we cannot distinguish between an event described by angles (9, 61, 62, 0i, ^2) 
and an event described by angles (tt — 9, 9i, 62, (pi + vr, 02 + tt). 



III. DIFFERENTIAL CROSS SECTIONS 

Angular distributions in ZZ'^*'> — > A^ provide a wealth of information on the production 
mechanism of the two Z bosons [TTHTHj 120] ■ Similar angular correlations in the vector boson 
fusion channel of Higgs production have also been discussed in Ref. [21] • As noted above, in 
this work we focus on the search of the Higgs boson in the golden channel, h ZZ^*'' — )■ 4£, 
and study the possibility of differentiating the Higgs signal from the dominant irreducible 
background qq — t- ZZ^*^ — )■ 4£ using spin correlations. In particular, we will compute the 
amplitudes in a helicity basis following Ref. [T7] . 
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FIG. 2: Feynman diagrams contributing to qq — t- M. We only consider (a), since final states from 
(b) have a total invariant mass at the Z mass. 

We will present the expressions for the fully differential cross section for both the signal 
and the background. Results for the Higgs production and decay have appeared in many 
previous works (see, for example, Refs. [HI [T3]4T5| 122] ) and are not new. They are given 
here for completeness. Earlier works on qq — )■ ZZ^*"* — )■ Ai include [lEl IT7j . 

Feynman diagrams contributing to qq — )■ 4£ are shown in Fig. [2j We consider only the 
diagram in (a) and the corresponding u-channel diagram, as we are interested in final states 
with a total invariant mass much larger than the Z mass. We will compute the amplitude 
of the diagram in Fig. [2] (a) and its w-channel partner in a helicity basis, following Ref . [T7] . 
The amplitude for the process under consideration factorizes into one production and two 
decay amplitudes: 

q{kq,a) + q{kg,a) — > Zi{ki, Xi) + Z2{k2, X2) , (25) 
Zi(A;i,Ai) ^ ^i(pi,ai)+£i(p2,cT2) , (26) 

Z2{k2, A2) ^ i2{p3, CT3) + i2{P4, CT^) , (27) 

where the momentum and helicity of each particle are indicated. A similar factorization 
obtains in the case of the gg h ^ ZZ^*^ M signal. As the production and decay 
amplitudes factorize, we will consider them separately. 

Before going into details of the computation, we state here the conventions we choose 
for various kinematic vectors and fermion spinors. We use the explicit expressions for the 
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four-momenta and polarization vectors of the Z bosons in the CM frame: 

= mi7i(l,0,0,/3i) , 
m272(l, 0,0,-/32) , 



k2 



V2 
1 

71 



(0,^1,^,0), ei' 



(0) 



(0, ±1,^,0), e 



(0) 



7i(/3i, 0,0,1) , 
72(/32, 0,0,-1) 



(28) 
(29) 
(30) 

(31) 



where kf 

the boost factors are 

1 y/s 

71 



'm^,i = 1,2 is the invariant mass of Zt, which could be off the mass shell, and 



:i + x) 



72 



X 



We will also use the explicit forms of u{p, A) and v{p, A) spinors: 
/ \ / \ fV2E\ 



X 



1 2 

m\ — 1712 



(32) 





/2E 

\ y 



Ml 



/2E 


V y 






V y 



/ \ 






(33) 



where E is the energy (or momentum) of the massless lepton. In these expressions, u{p, A) 
have been defined for p in the z direction and f (p. A) have been defined for p in the — z 
direction. Our conventions here are those in [171 HH] ; these were chosen so that our qq — )■ 
ZZ^*"* helicity amplitudes would reduce to those in Ref. [17] in the limit where the Z bosons 
are on shell. 



A. Production amplitudes for the signal 

Because the Higgs is a scalar particle, the two Z bosons can only have the following three 
helicity combinations: (0, 0) and (±1, ±1). Since the gluon-gluon- Higgs coupling is given by 

-hG.^G^" , (34) 



127rt; 

tZZ 



where v = 246 GeV is the Higgs vev, the production helicity amphtude M.h-XiX2 99 ~^ 
h — )■ ZZ'^*^ can be written as 

2 " 
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This is the amphtude for a particular spin and color configuration for the initial gluons. In 
the interest of clarity, we do not write the gluon helicities in the above amplitude, however 

these amplitudes should be taken as the amplitude for the ++ or initial state gluon 

helicities. For the other two helicity combinations, the amplitude vanishes. We will average 
the squared sum of these amplitudes over spin and color when finding the differential cross 



section. It is worth pointing out that, as one can easily see from Eq. (35), in the high energy 
limit when the Higgs is heavy, the boost factor 7^1 and the amplitude for two longitudinal 
Z bosons, (Ai, A2) = (0, 0), dominate over those for the transverse Zs. 



B. Production amplitudes for the background 



The production helicity amplitude for qq — )■ Z1Z2 in the CM frame reads [17j 



(37) 



'''^'4/3i/32sin2 e + (1 - /3i/32)2 - ^^(l + ^^^^f 
In the above Aa = cr — a, e = Ao"(— l)'^^, AA = Ai — A2, and Jq = max(|Acr|, |AA|). Note that 



in the limit of massless quarks, which we consider in this work, the amplitude in Eq. (37) 



vanishes for Act = 0. The left- and right-handed coupling of quarks to the Z boson are given 
by: 

(38) 



zqq leKTs - Qsin Q^) 
9aI = 



sin 9w cos 9w 

where T3 = for right-handed quarks. Furthermore, c^Ao-aaI®) ^ function in the 

convention of the Particle Data Group 



The coefficients Ax^x2 



AA = ±2 
AA = ±1 



A. 



Act 
±0 



V2{1 + P,f3,) , 

^[,A.AA)(l.^^)-2cose 



X 



(39) 



(40) 



A: 



Aa 
0± 



7i( 



[( AaAA) ( 1 + ^1±S] _ 2 cos e 

1 - x) [ V 2 y 



AA = 
AA = 



-{AaAX){Pl - Pl)x + 2xcose - {AaA\)(^ - ^1±^ 
^±± = -(1 - /3i/32) cose - AiAa(l + /3i/32)x , 



X 



^^0" = 27172 cose 



((l-x)/3i + (l + x)/32) 



/3i/32 



x^ 



(41) 
(42) 

• (43) 
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It can be checked that in the on-shell hmit when x — )■ 0, /3i,/32 — ^ and 71,72 7, 
the above expressions reduce to those presented in Appendix D of Ref. [T7]. Moreover, in 
the high energy hmit when 7 3> 1, the AA = ±2 amphtudes dominate, corresponding to 
(Ai,A2) = (±,T). 



C. Amplitudes for Z boson decay to leptons 

The decay hehcity amphtude for the 1 — )■ 2 process such a.s Z ii is essentially a matrix 
element of the spin-1 rotation matrix, which is the inner product between states with definite 
projections of angular momenta along a chosen axis We have defined our angles, in 
particular 9i, 62, 0i, and 02, such that we need consider only Zi{\) — )• ia-{9i, (pi)!^. That 
is, the amplitude has the same form for i = 1 and for i = 2. Specifically, following the 



conventions described above in Eq. (28) and (33), we find that 

■^it.,.. = Aa, (-1)^' gif d(Aa„ A„ 6,) e^^»'^% (44) 

where d{Aai, Aj, 6i) = (f^^^ l^^A' l^A ^(^Q.'^ in the conventions of Ref. [23] • In the above, rrii is 
the invariant mass of Zi, Aj is its helicity, AcTj = cxj — (Xj, and the coupling of the Z to the 
lepton pair is 



zee _ |e|(^3 + sin 9w; 
sm Uw cos &w 



where T3 = —1/2 (0) for left (right)-handed leptons. Note that the amplitude in Eq. (44) 



vanishes for Ao"j = in the limit of massless leptons, which is the limit taken in this work. 



For AcTj = —1, the expression in Eq. (44) reproduces Eq. (4.8a) in Ref. [TT]. 



D. Differential cross sections for signal and background 
The full amplitude of qq — )■ Z1Z2 — ?■ (^1^1) (^2^2) is then 

Mia, a; a,) = ^'J-MX, Dz{kl)Dz{kl) M^^l^.M^,.^,^ , (46) 



Ai ,A 



where the propagator factor for the Z boson may be taken to be 



Dz{q') = -2 2 V r • (4^) 

q'' — m| + ^mz^z 
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The full differential cross section is then (using the short-hand notation Q = 
{6, 9i, 02, 01, 02} and dQ = dcos 9 rfcos 9i rfcos 02 d(f)i d(j)2) 

m^r^^lln^l^K^^) [y.) (48) 

where 2tt comes from integrating over the unobservable production azimuthal angle, 1/4 
from averaging over initial spin states, 1/3 from averaging initial color states, l/(2s) from 
the incoming flux, /3i(l + a;)/(327r^) from the ZZ^*"* two-body phase space, l/(327r^) from 
the phase space of each of the ^^ final states, and l/(27r) from each drn^ integral that one 
obtains when re-expressing the four particle phase space in terms of the drnf variables (see 
for example [23]). 

Likewise, in the case of the gg h Z1Z2 — > (^1^1) (^2^2) signal, the full amplitude is 
M{h- a,) = Mil,^ Dz{kl)Dz{kl) -MI^U.-.-^W..-. ' (49) 

and the full differential cross section is 

dcy 111 /3i(l + a;) /l\Vl\^/\v^,..., ^,2 



dndm{dmi 4 8 2s 327r2 \?,2-n'^ J \2tx 



Note that the only significant difference between Eq. (48) and Eq. (50) is that color averaging 



yields a factor of 1 in Eq. (48), but | in Eq. (50). The factor of 2 before the sum in Eq. (50) 
is from summing over initial gluon helicities. 

Before proceeding, we note our definitions of Zi and Z2. When the two Z bosons decay 
into 2e2/i final states, we define Zi to be the parent particle of the electron/positron pair, 
while for 4e or 4/i final states, Zi is the Z with lower invariant mass. Technically, one 
could reconstruct e.g. a 4/i final state in two ways (in terms of assigning a particular muon 
and a particular antimuon to a given Z), however generally one of these reconstructions 
would make the resulting Zs far off-shell. Hence we only keep the reconstruction for which 
the product of the Z Breit-Wigner factors is largest. We note that the total event rate of 
^^{*) _^ 2e2/i is a factor of two larger than that of ZZ^*^ 4e or ZZ^*^ 4/i. 



E. Angular distributions 



Analytic expressions for the background and signal distributions are contained in Eqs. (48 ) 
and (50). However, they are too long to be presented explicitly and not very illuminating. 
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FIG. 3: Signal and background singly differential distributions atnih = ^/s = 220 and 350 GeV. The 
blue (dashed) lines are background distributions and the red (solid) lines are signal distributions. 

Instead we will show figures for all the singly and doubly distributions for the signal and 
the background in this subsection. 

In Fig. |3]we show normalized singly differential distributions in (pi, $ = vr — 0i — 02, 0, 
and $1 for the processes uu — )■ ZZ Ai and h ZZ — )■ 4i at ruh = y/s = 220 and 350 
GeV. (For simplicity, we take the Z bosons to be on-shell.) One notable feature is that the 
signal (pi distribution is flat, although there are correlations between (pi and (p2 such that $ 
exhibits a cos 2$ dependence. It was pointed out in Refs. [121 [13] that this observable is very 
useful in discerning the spin and CP properties of a singly produced resonance decaying to 
two Z bosons. In addition, distributions in the production angle G indicate that the ZZ pair 
produced by the background process tend to be in the forward region inside the detector; 
this is especially pronounced when the invariant mass is high. This is a feature of the t- 
channel production mechanism which has been suggested as a way to tag the vector boson 
fusion production of the Higgs boson, where there are two forward jets in the event. In the 
signal case, on the contrary, the Zs are produced isotropically, as expected from the fact 
that the Higgs is a scalar particle. 

It is also interesting to consider doubly differential angular distributions for signal and 
background, which are shown in Fig. [4} We plot distributions in the following pairs of angles: 
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Bkgd do-/N dcos dcos €i 
-220 GeV 




Bkgd dcr/N dcos dcos 6i 
V7 = 350 GeV 




Signal dcr/N dcos dcos 6i 



Vs" = 350 GeV 





Bkgd dtr/N dcos 0i di^, Bkgd dtr/N dcos fli d<^, Signal dcr/N dcos fli di^i 

iA7 = 220 GeV V7 = 3S0GcV V7 = 350 GeV 




Bkgd do-/N dcos fli dcos 



V7 = 220 GeV 




Bkgd dcr/N dcos fli dcos fli 
V7 = 350 GeV 




Signal dtr/N dcos dcos 62 
V7 = 350 GeV 




Bkdg d(r/N d^, d^j Bkdg dcr/N d^ij d^l^ Signal dcr/N d,^, dijj 

V"j =220GeV V7 = 350GeV Vs = 350 GeV 




FIG. 4: Doubly differential distributions for signal and background. See the text for more explana- 
tions. 

15 



(©,^i), (©,0i), {9i,4>i), {01,62), and (0i,02)- The background distributions are again from 
uu initial states and shown for ^/s — 220 and 350 GeV. We observe noticeable changes in 
the distributions of the first three pairs of variables for these two different center-of-mass 

energies. On the other hand, the signal distributions do not change as much when one varies 
the Higgs mass, except for the (0i, 02 ) distributions, and are shown only for ruh = 350 GeV. 
The background exhibits strong correlations between the pairs of angles in all five cases. 

IV. STATISTICAL PROCEDURES 

Likehhood methods are frequently employed to establish the presence, or lack, of a signal 
using kinematic distributions which discriminate signal from background. For example, if 
the purpose is discovery, one considers the "null hypothesis" assuming the observed data 
set is entirely due to background and an "alternative hypothesis" assuming the presence of 
both signal and background. A likelihood function is defined for each hypothesis to quantify 
the probability of obtaining the actual data under that particular hypothesis. In order to 
accept or reject one hypothesis in favor of the other, a "test statistic" must also be defined. 

In quantum field theory there is a natural object quantifying the probability of obtaining 
a particular event in a given set of data: the differential cross section. This motivates the 
use of the MEM, which is simply the use of likelihood methods where the probability density 
function (pdf) used in the likelihood is the properly normalized differential cross section (or 
"matrix element") with respect to certain kinematic variables^. When the number of events 
is small one can include every event in the evaluation of the likelihood. This "unbinned 
hkehhood method" is what we adopt in the following. 

A. Extended maximum likelihood method 

When there exist free parameters in the underlying hypothesis one wishes to test^, statis- 
tically preferred values of the parameters of the underlying model are found by maximizing 
the likelihood function with respect to those parameters. When the overall number of events 
is not fixed but allowed to fiuctuate, the normalization of the likelihood function may become 

^ We use "pdf" for probability density function and "PDF" for parton distribution function. 

^ In such cases the hypothesis is called "composite" , while those without free parameters are called "simple" . 
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a free parameter. In this case, one is using the "extended" maximum hkehhood method [25] . 
which we employ in this work. 

To be more specific, we consider the hkehhood for some coUider signature with (unknown) 
expected number of events fi described by kinematic information x = {xi,X2, ...xn}, where 
the Xi are kinematic variables describing event i of N total events. The unbinned likelihood 
is simply 

c{ii;e) = ^-^l[p{e-x.), (51) 

1=1 

where P{0; x) is the pdf for the kinematic variables as a function of 0, the parameters of the 
underlying model. In the MEM, P{6,x) is the differential cross section normalized by the 
total cross section, generally scaled by efficiencies and acceptances of the detector involved. 

We will define Ps{mh,Xi) to be the normalized pdf for the signal process, which depends 
on one underlying parameter, the Higgs mass rrih-^ The normalized pdf for the background 
process is then given by Pf){xi)] it is assumed here that the differential cross section for the 
background process is known and contains no unknown parameters. The likelihood function 
for the signal plus background hypothesis is then given by 

Cs+b{fi, f, nih) = \[[fPs{mh; X,) + (1 - m{xi)] , (52) 

i=l 

where fi = + fJ'b is the sum of the expected number of signal events fig and expected 
number of background events fib, while the signal fractional yield / is defined as 

< / = — ^ < 1 . (53) 

We often refer to / simply as the "yield". For Cb, the likelihood for the background-only 
hypothesis, we simply set fig = 0. When computing the likelihood for each hypothesis, we 
calculate the maximum of Cg+b in {z^, ^h, /} and Cb in /i, respectively''. In practice, since 
the Poisson distribution factors in the definition of the likelihood function, the maximum of 
the likelihood always occurs aX fi = N. So effectively one can replace fi by N, the measured 
total number of events, in the calculation. 



^ We set the Higgs width to the SM value for the given Higgs mass, m/j, as found by HDECAY 
^ In our fitting procedure, we fook for the focal maximum closest to the true Higgs mass. 
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B. Expected significance 



In determining the significance for Higgs discovery from a set of events, we are really 
comparing the likelihood of two hypotheses: (i) that the events consist of signal events from 
the Higgs boson at some mass, as well as events from background qq — ?■ ZZ^*'' production, 
and (ii) that all the events are due to qq — )■ ZZ^*^ production. Our choice of test statistic 
for describing this relative likelihood is the likelihood ratio 

Q = ^, (54) 

from which the significance of discovery is computed 

S = V2 InQ . (55) 

This test statistic was used by the LEP experiments in their Higgs searches [27], while its use 
for h ZZ^*^ ^ 4/i was studied in Ref . |2H] ■ In our case we include the angular correlations 
in the likelihood function. The expected significance S is then obtained by performing a large 
number of pseudo-experiments and choosing the median value. The 1 and 2 a spreads in S 
are determined from the distribution of significances obtained from the pseudo-experiments. 



C. Exclusion limit 



We determine the exclusion limit, in the absence of a signal, by setting an upper limit on 



the yield, /, defined in Eq. (53). For a particular choice of Higgs mass rhh, we define a pdf 



/ by considering the likelihood Cg+b function of /, 

Pif) = /-^^^^'^'^^^ _ . (56) 
Jo Cs+b{N,f,mh) df 

The 95% confidence level limit on / for a given set of data is given by a as follows: 

/%(/)# = 0.95. (57) 

We then translate a into a 95% confidence level upper limit on the Higgs production cross 
section by unfolding with the detector acceptances and efficiencies. The expected exclusion 
limit is obtained by performing a large number of pseudo-experiments. 

It should be noted that the procedure above for setting the exclusion limit only takes into 
account differences in the shape of kinematic distributions between signal and background. 
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In particular we are mainly interested in possible improvements by including angular distri- 
bution in addition to the invariant mass spectrum. Typically experimental collaborations 
set limits directly on the normalization of the signal cross section by performing counting 
experiments in a particular window of total invariant mass. In comparison with the standard 
CLs method employed by ATLAS and CMS collaborations, our method should be consid- 
ered as a shortcut for the purpose of understanding the improvement from incorporating 
the angular correlations. One would hope to incorporate both the counting experiments and 
shape measurements in a more complete study. 



D. Probability density functions 

In this subsection we define the signal and background pdfs that enter into the likelihood 



function in Eq. (52). The kinematic observables are x = {Y, s,m1,ml,Q}, where Y is the 
pseudo-rapidity of the ZZ^*^ system, s is the partonic center-of-mass energy, mi(2) is the 
invariant mass of the ^1(2) boson, and Q represents the production and decay angles. 
The signal pdf is 

Psimh, x) = 1 das{m,;x) 
* ' egCTs (iTLh) dY ds dm\ dm"^ dVt ' 

where (Ts{mh) is the total hadronic cross section, das is the corresponding differential cross 
section, and is the total signal efficiency (which in principle includes geometric acceptance 
as well as reconstruction efficiencies). More explicitly, 

p . X _ 1 f fgixi)fg{x2) \ dahiTnh,s,Tni,m2,il) . , 



escrs(m/j) \ s J dm\dm\dVL 

In the above, ah is the partonic cross section, and the fg{x) is the gluon PDF. 

For the background pdf we take into account the fact that, in a hadron collider, we are 
unable to determine the direction of the initial quark (as opposed to the anti-quark) on an 
event-by-event basis by calculating the cross section for each choice of initial quark direction 
and summing the two. This may be written as 

p [x] = ^ ( ( /g(^i)/g(^2) A daqq{s,mi,m2,Vt) ^ ( fqixi)fq{x2) \ dagg{s, mi, ma, fi' 



ef,agq W s J dm'f dm'2 dVL ' \ s J drnf dm^ dVt' 

(60) 

where fi' = (tt - 6,^1,^2, 01 + TT, 02 + tt) is the shift in angles needed for an initial quark 
in the —z direction and we have switched the quark PDF with the anti-quark PDF (or 
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FIG. 5: Effect of detector resolution in measuring E and pT on kinematic variables. Each of the 
histograms were generated using twenty thousand events. The channel used for these plots is the 
background 2e2^ channel. 

equivalently switched xi and X2). The total qq — t- ZZ* — )► 4£ cross section is given by agq 
with the function representing the quark (anti-quark) PDF, and e?, the total efficiency 
for this channel. In calculating the cross section, a^g, for the qq — ?■ ZZ^*^ — )■ 4£ background, 
we sum over the quark flavors, u, d, s and c. We use CTEQ5L for the PDFs indicated in 



Eq. (59) and Eq. (60) as fi{x) 0\. 



V. MONTE CARLO SIMULATIONS 



We generate the signal and background events for our analysis using Mad- 
Graph/ MadEvent (MG/ME) version 4.4.52 [30]. Proton-proton collisions at ^/s = 7 TeV 
are implemented with the CTEQ5L [29] PDFs. As noted above, the main irreducible back- 
ground, and the only one which we will consider in our analysis, is qq — )■ ZZ^*^ — )■ 4£. For 
signal we consider only gg ^ h ^ ZZ'^*^ — > 4£, without additional jets in the final states; 
effectively our analysis considers only the "0-jet bin" for the ZZ'^*'^ — > 4£ channel. 
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Detector effects are modeled by applying Gaussian smearing to the energy of electrons 
according to 

and to the px of muons according to 

ap^^^ = 0.015 pt - 5.710'^ p^ + 2.210"^ (62) 



These expressions follow the CMS TDR pi]. The value of the constant term in Eq. (61 ) may 
be somewhat optimistic. However, we verified that our results do not change significantly 
when using a value for this quantity that is twice as large. The smearing of the muon pt 
is quite conservative as we have used the resolution corresponding to 1.8 < \ri\ < 2.0 and 
assumed this to be the same for lower values of 1] as well. The angles measured in the lab 
frame are not smeared. Nevertheless, angles which define the kinematics of the four lepton 
system, as described in Sect. [TT| are affected by the E and pt resolution, as can be seen 
in Fig. |5| A more sophisticated treatment of detector effects would include the effects of 
reconstruction efficiencies, which we neglect for simplicity. 

Another effect of smearing the lepton E or p^ is that the four lepton system will now 
have a small pt in the lab frame, even without the presence of additional jets or particles in 
the final states. We simply boost away this induced px and proceed to define angles as in 
Section HIl 

After generating events and smearing the lepton px, we apply the following cuts to each 
of the lepton in the lab frame: 

\Pt\ > 10 GeV , 
\v\ < 2.5 . 

Moreover, we focus on the following window for the Ai invariant mass 

150 GeV < s < 450 GeV . (63) 

The efficiency (geometric acceptance) for our selection of signal events is listed in Table [l] 
and varies from ~ 0.5 to ~ 0.7 as we increase the mass of the Higgs boson from 170 GeV 
to 350 GeV. The efficiency for selection of background events is 0.52. Moreover, these cuts 
affect the reconstructed angular distributions. As can be seen in Fig. |6| the distributions 
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m/,(GeV) 


a(fb) 


e 


{N) 




175 


0.218 


0.512 


0.279 




200 


1.26 


0.594 


1.87 


Signal 


220 


1.16 


0.625 


1.81 




250 


0.958 


0.654 


1.57 




300 


0.714 


0.701 


1.25 




350 


0.600 


0.708 


1.06 


Background 




8.78 


0.519 


11.4 



TABLE I: Expected number of events {N) for the signal and background 2e2/i channel at 2.5fb~^ . 
The SM cross section without any cuts is denoted by a, while e is the efficiency of our analysis 
cuts. The 4^ o,nd 4/i channel yields are half of the 2e2// channel and their efficiencies are the same 
as the 2e2/i channel. 

with respect to 6, 0i and 02 are modified most significantly by our acceptance cuts. In 
the case of the B distribution it is easy to see that the loss of events in the forward region 
due to pt and rapidity cuts cause a change in the shape of the distribution. The 0i and 02 
distributions undergo less trivial modifications. 

Also listed in Table [T] are the cross sections and expected number of events after selection 
cuts, with an integrated luminosity of 2.5 fb^^, for both signal and background at the LHC 
with ^/s = 7 TeV. The cross sections for the signal and background processes are obtained 
from MG/ME with K-factors applied. As we are considering the 0-jet bin for signal and 
background, there are a number of complications in determining the appropriate higher order 
cross section [32], a complete study of which is beyond the scope of this paper. Therefore 
we simply apply a i^'-factor of 1.5 for the signal and a i^'-factor of 1.33 for the background. 
(The i^-factor for the background is the value obtained in Ref. [331.) note that these 
i^-factors, are not used in the analyses themselves, but only in determining the number of 
events of each type to include in the pseudo-experiments. 
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FIG. 6: The effect on cos0 and (f)i distributions from rapidity and transverse momentum cuts on 
final state leptons. The panels on the left show the distribution for background events before (red) 
and after (blue) the cuts. The panels on the right show the distributions for signal events before 
(blue) and after (red) the cuts. Signal distributions correspond to a SM Higgs boson with ruh = 300 
GeV. 

VI. RESULTS 



In this section we present our results for the expected significance and expected exclusion 
limits from the MEM, which includes the following kinematic variables for the four lepton 
final state, 

X = {Y, s, ml, ml, 9, 6*1, 6*2, <^i, </)2} • (64) 

For comparison we also present results from an analogous analysis, which uses only invariant 
mass information in the pdf. (Specifically we consider the product of Breit-Wigner factors 
for the Higgs as our signal pdf and use the invariant mass distribution from simulated data as 
the background pdf.) Our results are for the LHC at ^/s = 7 TeV and integrated luminosities 
of 2.5, 5, and 7.5 fb~^ for discovery and 1, 2.5, and 5 fb^^ for exclusion. 
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A. Expected significance 



To compute the expected significance, we perform ten tfiousand pseudo-experiments 
(eacli) for Higgs masses of 175, 200, 220, 250, 300, and 350 GeV. Eacfi pseudo-experiment 
consists of signal and background events witli 4/i, 2e2/i, and 4e final states; the number of 
events of each type are chosen from Poisson distributions, where the expected numbers of 
signal and background events are given by the product of the luminosity under considera- 
tion, the theoretical cross section, and acceptance efficiencies after smearing and cuts. The 
cross section after cuts is also used to normalize the signal and background pdfs Pg and 



Pb in the likelihood function in Eq. (52). As we do not include the different reconstruction 
efficiencies for electrons and muons in our analysis, Pg and Pb are identical for 4/i, 2e2/x, and 
4e final states. However, we do consider the yields in each channel as separate parameters 
when finding the significance from pseudo-experiments by maximizing the likelihood with 
respect to the undetermined parameters. 

The median values of significance obtained for each Higgs mass in this channel, using 
both the MEM (all kinematic variables) and the likelihood method (invariant mass only), 
are shown in Fig. [7| The la and 2cr bands on the significance obtained from the MEM 
are also shown in this figure. We do not show the corresponding bands for the invariant 
mass-based analysis; the widths of the bands for this case are similar. 

We note that the MEM consistently outperforms the invariant mass only method, and 
that the effect is more pronounced, on the order of 10 - 20 %, at higher Higgs masses. This 
is because the helicity amplitudes for h — ?■ ZZ^*^ and qq — t- ZZ^*^ are increasingly different 
at larger values of invariant mass. More specifically, as already pointed out in Sect. |HI[ 
the (Ai,A2) = (±l,=l=l) amplitudes dominate in the high mass case for qq, while only the 
(Ai, A2) = (0,0) amplitude survives in the heavy Higgs limit in the h — )■ ZZ^*^ case. 

B. Exclusion 

In Figure [sj we show the median 95% confidence level exclusions on the yield parameter 
/, both as obtained with the MEM and as obtained with a likelihood method using only 
invariant mass information. These limits were obtained using the procedure described in 



Subsection IV C to find the 95% confidence limit on / in a given pseudo-experiment consist- 
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Integrated Luminosity: 2.5fb 




200 250 300 350 

nih (GeV) 

Integrated Luminosity: 5fb~^ 
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200 250 300 350 

nih (GeV) 

Integrated Luminosity: 7.5fb~^ 
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2- 



200 250 300 350 

nih (GeV) 

FIG. 7: Comparison of significance for Higgs discovery obtained from using only invariant mass 
information (brown dotted line) with that obtained from all of the kinematic variables using the 
MEM (black solid line) at different integrated luminosities. The green and yellow bands correspond 
to less than 1 and 2 a deviations from the median expected significance in the case where we use 
all kinematic variables. 
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FIG. 8: The panels on the left contain 95% confidence level exclusion limits on the Higgs signal 
yield, f, while the panels on the right contain these limits interpreted as limits on the Standard 
Model Higgs cross section. The exclusions represented by the brown, dashed line were obtained 
using only the invariant mass distribution, while all of the kinematic variables were used (via the 
MEM) in obtaining those exclusions represented by the black solid line. The green and yellow 
bands correspond to less than 1 and 2 a deviations from the median expected exclusion in the case 
where we use all kinematic variables. These results were obtained by performing one thousand 
pseudo-experiments. 
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ing only of background events, which were generated, as in our investigation of significance, 
using the method described in Section |Vj We note that in obtaining these hmits, the yield, 
/, was fixed to be the same for each of the three channels (4e, 2e2/i, and 4/x), and one 
thousand pseudo-experiments were performed. 

We can translate a limit on the yield to a limit on the Higgs production cross section, 
since the yield is defined as the ratio of the expected number of signal events to the expected 
number of signal and background events. As this is the expected number of events after the 
selection cuts are applied, it is proportional to the cross section times efficiency, rather than 
simply the cross section. Specifically, we find 

^s=('A(-^)cr„ (65) 



where es(b) and as{b) are the signal (background) efficiency and cross section respectively. 



Eq. (65) allows us to translate our limits on / into a limit on as, the Higgs cross section, as 



is done in Fig. [8] For simplicity we do not include any systematic uncertainties (e.g. on the 
efficiencies) in the analysis. We see that, as in the investigation of significance considered 
above, the difference in sensitivity between the MEM and the invariant-mass-only analysis 
is greater in the higher Higgs mass range. 



VII. CONCLUSIONS 

Traditional search strategies for the Higgs boson in the ZZ^*^ — )■ M channel concentrate 
only on measurements of the total invariant mass of the four leptons. Since the four-momenta 
of all final state particles can be reconstructed, we have considered in this work the possibility 
of including all available kinematic information in discriminating the Higgs signal from the 
dominant irreducible background at the LHC with ^/s = 7 TeV, by implementing the Matrix 
Element Method. 

We first derived Lorentz-invariant expressions for the production and decay angles used 
in the analyses, which allow for reconstruction of these angular variables from momenta 
measured in the laboratory frame. Then we presented analytic expressions for the fully 
differential distributions for both signal and background, allowing the Z bosons to be off- 
shell. 

We found that greater sensitivity in discovering or excluding the Standard Model Higgs 
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boson can be achieved when including spin correlations in addition to total invariant mass 
measurements. Generally, these improvements are on the order of 10 - 20 %; they are 
larger for higher Higgs masses, for which the differences between signal and background 
distributions are greater. 

Searching for the Higgs boson is among the top priorities of the physics program at 
the LHC Our results indicate it would be worthwhile to include full angular distributions 
in actual experimental searches in the golden channel through some type of multivariate 
analyses. Whether a Standard Model Higgs boson will be discovered or excluded in the near 
future, we eagerly await the result! 
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